A variety of slightly curated youtube videos on the topic of AI.
Author: Jonathan Hendler
Sneak peek at early decisions at HAI
Recently launched a site with some basic vision behind HAI.
But the tech stack is where the rubber meets the road. I’ve been coding about two months now. At the very beginning I went through a fair amount of thinking and ended up selecting a language for the backend based on a number of factors. From languages I knew, C++, Go, PHP, Python, Java/Scala, and Node.js were on the table. Python and Java were the two top contenders, but I ended up going with Python.
So far I’ve been really happy with Python for both flexibility of the language, the available libraries for both web and machine learning, and the developer community. Ruby / Rails has an amazing community and great web stack, but given my own lack of familiarity and less work being done in machine learning, it didn’t make my list.
Then I started evaluating open source projects that would be the platform. There are 132 on the list below (looked at least 4x that many). It’s been amazing getting up to speed on the projects that are open source. Although Google, IBM, Amazon and others are clearly going to lead in the machine learning space for the foreseeable future, the open source community is catching up.
Open source is a moving target, and there’s no one size fits all when you are piecing together something new. So, I’ve been using the awesome ZeroMQ library to connect services between libraries, languages.
Finally, thanks to everyone who has provided feedback so far. Can’t wait to get what I’m working on out into the world.
More about HAI
For a few weeks I’ve been having meetings with advisors and colleagues. For those I’ve not been meeting, I’ve fallen behind in communicating what I’m working on.
Not a stealth startup, but there’s also a lot that’s yet to be determined. I’d prefer to be open, but there are some specifications that I’ll keep under wraps for a variety of reasons…. When building a stealth aircraft, at the least you can tell people that you are building an aircraft. Skunkworks doesn’t make sandwiches.
Company vision and culture will be in large part determined by cofounders. Here’s where there’s some definition:
- HAI means AAI (Artificial Artificial Intelligence) – humans intelligence built into a process that’s usable by computer intelligence
- Ethical prime directives. See Friendly AI
- a sustainable business model early on
- Company culture of sustainable innovation modeled after Google’s large revenue generating platform supporting R&D.
- large, very talented, diverse founding team. Diversity is a no-brainer. Large is about five people; I’d rather create value from equity by distributing to founders than funders.
- Boston still has untapped talent and potential. Even if developers can find jobs easily, what kind of job would an engineer want for the rest of their life?
Oh. HAI.
HAI is the new company I’m working on. Human Assisted Intelligence. HAI will help computers learn about people. I’m so excited to be starting a new venture.
The future is happening right now. How far in the future will a product be relevant if you start developing it today?
Computer software / hardware outperforms humans in many specialized tasks today, and will likely surpass humans in categories reserved for our most revered public figures (scientists, politicians, performers) within 10-40 years. After the Singularity, quoting the WikiPedia main article -“Since the capabilities of such intelligence would be difficult for an unaided human mind to comprehend, the technological singularity is seen as an occurrence beyond which events cannot be predicted.” Where will humans fit into this future? A question for science fiction, perhaps.
Before the Singularity, what will help us engage effectively with a world increasing its complexity, knowledge, and economic dynamics exponentially? If super computer intelligent systems are used only by the most powerful institutions, what kind of intelligent service represents the individual?
I think these are the most interesting, important challenges around AI. Today, I’m building a team and product prototypes. If you’re interested in collaborating, feel free reach out.
Earning leadership in parenthood and business.
As a leader, control leaves your hands in one of two ways: by consent, or not by consent. Either way it happens. Choose consent.
Maybe this is the opposite of the Steve Jobs. I really don’t know, but feels like Jobs had a great sense of timing, how far he could fight for what he believed, and when to “give in”.
Perhaps there is no way to stop people from putting beans up their nose. Having insight ignored is painful if you care at all about what you are doing. If the fight is too extended, no matter if right or wrong, you end up appearing arrogant or stubborn, but if you are right it doesn’t matter. And if a company needs focus and either way is right, it doesn’t matter which is chosen, only that something is chosen and committed to.
I read Herman Hesse’s “Journey to the East” twenty years ago. Summary: leadership is service. As a new parent, I think this is especially true.
Startups are built with both Undone and Unsung Engineering
- protect the data (files/database, etc)
- back it up
- make sure data is valid (ideally some smoke tests, unit tests, regression tests on the data layer of an application)
- back it up
- source control with easy to understand policies and branching
- back it up
- automate backing it up
- fix problems with the user experiences
- small visual problems can make people … disgusted. Fonts, cross browser issues take a lot of time, but matter to people who expect things to work.??
- client (browser) speed – a small javascript issue can cause a page load to appear to take 5 seconds. Perceived performance is all that matters.??
- fix display issues – if they think data is lost, then it is??
- integrated QA team – make everyone on the team test the site, but not their own work.
- make coding fast
- establish team communication that makes product iteration rapid
- use frameworks, existing but stable libraries (open source so you can fix critical problems yourself)
- keep version control and deployment as automated as possible for the whole team
- write easy to understand code
- document code and best practices (wiki style)
- track problems (bugs and backlogs)
- write test code (on critical areas)
- housekeeping
- refactor
- cleanup
- more test code
- new features/ scaling/performance??- probably the most fun for an engineer, but last on the list. This has been the hardest for me. ??Engineers need to think about scaling when the system is architected…. maybe. But when you have just a 10 customers and the entire database fits in memory… why worry?
- database ??- a typical bottleneck. NoSQL and distributed architectures are making this easier
- separate web server from application layer. Concentrate core application logic into languages that are better suited for scaling and concurrency issues
- load balance, messaging queues, etc
The divorce: Adobe Vs. Apple
Apple and Adobe are both claiming a moral high-ground.
At first Adobe was the bad-guy for allegedly blocking further development of open standards like HTML5 and not opening up Flash.
Open source, open platforms, and open standards are really the only sustainable way forward.
I say this not to take sides. As developers and creatives we must make choices that protect our future. Protect does not mean “defend”. Protect here means “ensures sustainable happiness for all involved”. Our destinies are already intertwined, but intertwined should not feel like a lock-in, trap or prison. Freedom can co-exist with interdependence.
Thank you XEROX (for not Patenting the GUI)
Dear XEROX (PARC),
Thank you for not patenting the Graphic User Interface. Conventional wisdom has been that XEROX missed out on a huge opportunity – and maybe that’s true. However, maybe Microsoft and Apple (and Linux for that matter) would have never been such great successes. Maybe we would all have had to wait for the Newton or the IPad before the innovation could have been surpassed.Windows 7 was not my idea.
Soliciting Advice: highly concurrent, available, non-blocking server
- Easy to use (build, deploy, monitor)
- Plentiful external, stable, pre-integrated Libraries
- Use case: distributed, non-blocking web services
- Quite a bit of message and job queueing
- multiple databases , caching
- Scala
- pros
- assume it will take great advantage of Java 1.7 concurrency, multi-core
- fairly trivial import of external java libs (except for non-thread safe?)
- e.g. ZooKeeper – http://hadoop.apache.org/zookeeper/docs/r3.0.0/index.html
- cons
- new language syntax, paradigm learning curve
- doubts about JVM memory efficiency and stability as resources are constrained
- pros
- Server Side Javascript – via node.js
- pros
- redis integration for caching
- fast, lightweight, easy language.
- some custom js would be portable to browsers (coolness)
- cons
- very new
- performance
- not as many external libs?
- pros
- Tornado
- pros
- Python
- as many libs as Scala
- Python
- cons
- narrower use cases
- performance
- pros
Field Collapsing in SOLR
wget https://issues.apache.org/jira/secure/attachment/12440108/SOLR-236-trunk.patch
patch -p 1 -i SOLR-236-trunk.patch